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Abstract 

We present a new and simple approach to concentration inequalities for 
functions around their expectation with respect to non-product measures, 
i.e., for dependent random variables. Our method is based on coupling 
ideas and does not use information inequalities. When one has a uniform 
control on the coupling, this leads to exponential concentration inequali¬ 
ties. When such a uniform control is no more possible, this leads to poly¬ 
nomial or stretched-exponential concentration inequalities. Our abstract 
results apply to Gibbs random fields, in particular to the low-temperature 
Ising model which is a concrete example of non-uniformity of the coupling. 
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1 Introduction 


By now, concentration inequalities for product measures have become a stan¬ 
dard and powerful tool in many areas of probability and statistics, such as 
density estimation [S], geometric probability m, etc. A recent monograph 
about this area is m where the reader can find much more information and 
relevant references. Exponential concentration inequalities for functions of 
dependent, strongly mixing random variables were obtained for instance in 
[TOl ITU fn^ fTHl [201 [10]. In the context of dynamical systems Collet et al. |0] 
obtained an exponential concentration inequality for separately Lipschitz func¬ 
tions using spectral analysis of the transfer operator. In m, C. Kiilske obtained 
an exponential concentration inequality for functions of Gibbs random fields in 
the Dobrushin uniqueness regime. Therein the main input is Theorem 8.20 in 
[O] which allows to estimate uniformly the terms appearing in the martingale 
difference decomposition in terms of the Dobrushin matrix. In US], K. Marton 
obtained exponential concentration results for a class of Gibbs random fields 
under a strong mixing condition lying between Dobrushin-Shlosman condition 
and its weakening in the sense of E. Olivieri, P. Picco and F. Martinelli. 

Besides exponential concentration inequalities, polynomial concentration in¬ 
equalities easily follow from upper bounds on moments. In the context of prod¬ 
uct measures, bounds on the variance are well-known mis]. In the context of 
dynamical systems, a bound on the variance is obtained in | 2 ]. 

The approach followed in miisiiiniiin] uses coupling ideas and informa¬ 
tion inequalities, such as Pinsker inequality. Such inequalities can only lead to 
exponential concentration inequalities. This can be understood easily since it 
is well-known ^ that there is equivalence between information inequalities and 
exponential inequalities on the Laplace transform, the latter yielding exponen¬ 
tial concentration inequalities by Chebychev’s inequality. 

The purpose of the present paper is to derive abstract bounds allowing 
to obtain not only exponential, but also polynomial and stretched-exponential 
concentration inequalities. In particular, this means that we do not use infor¬ 
mation inequalities. Going beyond the exponential case was motivated by the 
low-temperature Ising model which can not satisfy an exponential concentra¬ 
tion inequality for the magnetization. Here we obtain abstract concentration 
inequalities using a coupling approach. Our setting is (dependent) random 
variables indexed by Z'^, d > 1, and taking values in a finite alphabet. We 
are interested in obtaining concentration inequalities for “local” functions g 
around their expectation 'Kg in terms of their variations. The inter-dependence 
between random variables is measured by a “coupling matrix” which tells us 
how “well” one can couple in the far “future” if the “past” is given. If the cou¬ 
pling matrix can be uniformly controlled in the realization, then an exponential 
concentration inequality follows. If the coupling matrix cannot be controlled 
uniformly in the realization, then we typically obtain bounds for moments and 
for Luxembourg norms of <7 — Kg. In the former case this leads to polynomial 
concentration inequalities, in the latter case this gives stretched-exponential 
concentration inequalities. 

As a first application of our abstract inequalities, we obtain an exponen- 
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tial concentration inequality for Gibbs random fields in a “high-temperature” 
regime, complementary to the Dobrushin uniqueness regime studied in m- A 
second application is the “low-temperature” Ising model for which the coupling 
matrix cannot be uniformly controlled in the realization, and for which the 
previous methods PH EDI do not apply. We obtain polynomial, even stretched- 
exponential, concentration inequalities for the low-temperature Ising model. 
Let us mention that our concentration inequalities yield various non-trivial ap¬ 
plications which will be the subject of a forthcoming paper. 

The paper is organized as follows. In Section 2, we state and prove our 
abstract inequalities, first in the context of random fields indexed by Z, and 
next when the index set is Z*^, d > 2. Section 3 deals with high-temperature 
Gibbs measures and the low-temperature Ising model. 


2 Main results 


Let A be a finite set. Let g : —> M be a function of n-variables. An element a 

of the set is an infinite sequence drawn from A, i.e., a = (ui, (T 2 , ..., cxj,...) 
where Ui G A. With a slight abuse of notation, we also consider g as a function 
on which does not depend on for all k > n. 

A concentration inequality is an estimate for the probability of concentration 
of the function g from its expectation, i.e., an estimate for 

P{|5-E5|>t} (1) 


for all n > 1 and all t > 0, within a certain class of probability measures P. For 
example, an exponential concentration inequality is obtained by estimating the 
expectation 


E 


gA(g-Eg) 


for any A G M, and using the exponential Ghebychev’s inequality. 

However, there are natural examples where the exponential concentration 
inequality does not hold (see the example of the low-temperature Ising model 
below). In that case we are interested in bounding moments of the form 


E [{g - EgfP] 


to control the probability ©• 

In this section, we use a combination of the classical martingale decompo¬ 
sition of g — Eg and maximal coupling to perform a further telescoping which 
is adequate for the dependent case. This will lead us to a “coupling matrix” 
depending on the realization a G A^. This matrix quantifies how “good” future 
symbols can be coupled if past symbols are given according to a. Typically, 
we have in mind applications to Gibbs random fields. In that framework, the 
elements of the coupling matrix can be controlled uniformly in a in the “high- 
temperature regime”. This uniform control leads naturally to an exponential 
concentration inequality. At low temperature we can only control the coupling 
matrix for “good” configurations, but not uniformly. Therefore an exponential 
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concentration inequality cannot hold (for all g). Instead we will obtain poly¬ 
nomial and stretched-exponential concentration inequalities. This will be done 
by controlling moments and Luxembourg norms of g — Kg. 


2.1 The coupling matrix 

We now present our method. For i = 1,2, ...,n, let J^i be the sigma-field 
generated by the random variables ai,... ,ai, and be the trivial sigma-field 
{0, n}. We write 

n 

g{ai,...,an)-Kg = J2'^ii^) (2) 

i=l 

where 

Viia) := E[g\Ti]{a) - K[g\J^i_i]{a) = 

j K{dgi+i ■ • -drynlcJi,... ,£7*) g{ai,.. ...,gn) 

- j ..., (Ti_i, r/j, r/j+i, ..., 77^) = 

j IP(dr?i+i ■ ■ ■ dgnWi, ■ ■ ■, <^i) g(cri, di+i, ■ ■ ■ ,Vn) 

- j F{dgi\ai,. . . J K{dgi+i ■ ■ ■ dgnWi, ■ ■ .,cri-i,gi) g{ai,.. ., crj-i, r/j, r/i+i, ...,gn)< 


max 1 
aGA J 

^ ¥{dgi+i ■ ■ 

• dgn\(yi,. 

■ ■ ,cri-i,cri = a) g((Ti,.. 

• 5 1 • • 

■,r]n) 

— min 
bGA J 

^ K{dgi+i • 

■■dgnWi, 

.. .,ai-i,ai = b) 5 (fTi,. 

• • ; O'i—l^ 6, ^2 + 1 , . . 





=-.Y,{a)-X,{a). 


(3) 


Denote by ^ = P^^ ^ the maximal coupling |12j of the conditional distri¬ 
butions P((ir7>i+i|(Ti,..., cJi_i, CTj = a) and P((ir7>i+i|(Ti,..., cJi_i, (jj = b), that is, 
the coupling for which Now we introduce the (infinite) upper-triangular matrix 
defined for i,j G N by 

D- ■= 1 


^i,i+3 ■= } • (4) 

Notice that if the cij’s are mutually independent, then is the identity matrix 
because the conditional distributions K{dg^i+i |f7 i,..., £7i_i, cjj = a) and 
P((ir7>i+i|(Ti,..., cJi_i, (Tj = h) are equal. Hence we have a perfect coupling in 
this case. 

We proceed with the following simple telescoping identity: 

g{ai ,..., (Ti_i, a, ..., a^'>) - g{ai,..., ai-i,b, cr|+\, ...,a^'>) = 

[g{ai,..., cJi-i, a, ..., - g{ai ,..., 6, crj^\,..., 0-^)] + 
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• • •, CTi-i, b, , cjW) - g{ai,.. 

1 . ( 2 ) ( 1 ) ( 1 ) 

)] + 


, cr*-i, b, ,..., ) -g{ai ,, 

U ( 2 ) ( 2 ) ( 1 ) 



+ ••• + 




L ( 2 ) ( 2 ) ( 2 ) (l)x 

. , 6 , CJi+ 2 , • • • Wn -1 WW) 

- g{ai,... ..., 



=: Vg+jS, 

i=o 

We define the variation of g at site i by 

5ig := sup \g{a) - g{a ')\, 

crj=a'. 

and by construction we have the inequality 

< Si+j9 ■ 

i+j'i+j 

It follows from m and that 

Yi{a)-Xi{a) = 


max \ / ¥{dgi+i ■ --dgnlcri, • • = a) g{ai,... ,ai-i,a,gi+i, ...,gn) 

a,beA I J 

- j P(dr?i+i • • • dgn\cri, ■ ■ • ,crj-i,o-j = b) g{cri ,..., 6, r/j+i,... ,7?n)| 

= max I f 

a,b£A K J ’ ’ 

[g{cri,ai-i,a, ..., - g{ai,ai-i, b, ..., | 

n—i 

^ ^ ^ 

’ j=0 


n—i 

3=0 

where 5g denotes the column vector with coordinates 6jg, for j = 1,... ,n, and 
0 for j > n. Therefore, we get the inequality 

Viia) = Y,{a)-X,{a)<{D‘^ 6 g)i. (5) 


Applying the above reasoning to —g shows that the previous inequality also 
applies to —Vi- 

Remark 1. The advantage of the previous bound is that it only involves 6g. 
One eould imagine to consider, for instanee, the second moment of Vj^_f_jg 
instead. This could lead to better results but it has the drawback that we need 
to know much more about the coupling than we usually do. 
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2.2 Uniform decay of D^: exponential concentration inequality 

Let Dij := sup^^^N We assume that the following operator £^(N)-norm is 
finite: 

:= sup < oo . (6) 

nGfe(N),||«|1^2(^)=l 

We have the following exponential concentration inequality. 

Theorem 1. Let n G N 6e arbitrary. Assume that holds. Then, for all 
functions g : M, we have the inequality 


’{Is -%| >t}< 2exp 



(7) 


for all t > 0. 

Proof. We recall the following lemma which is proved in [^. 

Lemma 1. Suppose T is a sigma-field and Zi,Z 2 ,V are random variables such 
that 

1. Z\ ^ IL < Z 2 

2. E(1/|U) = 0 

3. Z\ and Z 2 are iF-measurable. 

Then, for all A G M, we have 

( 8 ) 


We apply this lemma with V = Vi, T = Ti-\, 

Zi = Xi- ElglXi-i], Z 2 = Yi- E[ 5 r|Ui_i]. Using inequality © 

Vi{a) = Y,{a) - X,{a) < {D^6g)i 


we obtain 

E(e^^“|Ui_i)((j) < _ (9) 

Therefore, by successive conditioning, and the exponential Chebychev’s inequal¬ 
ity, 

F{g-Eg>t} < e-^^E 

< e“^*E (E{e^^" |U„_i)e^ 

< ... < e-^*exp (10) 

< e-^*exp . 
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Now choose the optimal A = At/ obtain 


2f 


P {g - > t} < exp -^ 2 II r IIS 

V \\6g\\; 


Combining the inequality for g and the one for —g yields 0. The theorem is 
proved. □ 


2.3 Non-uniform decay of D^: polynomial and stretched-exponential 
concentration inequalities 

If the dependence on a of the elements of the coupling matrix cannot be con¬ 
trolled uniformly, then in many cases we can still control the moments of the 
coupling matrix. To this aim, we introduce the (non-random, i.e., not depend¬ 
ing on a) matrices 

npiirf'” ( 11 ) 

for all p € N. 

A typical example of non-uniformity which we will encounter, for instance 
in the low-temperature Ising model, is an estimate of the following form: 

+ ( 12 ) 

where ipj > 0 does not depend on a, and where are unbounded functions of 
a with a distribution independent of i. The idea is that the matrix elements 
^ii+j “start to decay” when j > fi((T). The “good” configurations a are those 
for which fj (cr) is “small”. 

In the particular case when m holds, in principle one still can have an 
exponential concentration inequality provided one is able to bound 

However, in the example given below, the tail of the ii will be stretched expo¬ 
nential. Henceforth, we cannot deduce an exponential concentration inequality 
from these estimates. 

We now prove an inequality for the variance of g which is a generalization 
of an inequality derived in ^ in the i.i.d. case. 

Theorem 2. Let n € N be arbitrary. Then for all functions p : A” — > R we 
have the inequality 


E[{g-Egf] < • (13) 

Proof. We start again from the decomposition 0. Recall the fact that 
E[Vi|.Fj] = 0 for all i > j, from which it follows that E[ViI/;] = 0 for i ^ j. 
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Using © and Cauchy-Schwarz’s inequality we obtain 


□ 


n 

E [{g-¥.gf]=¥.Y,V? 

i=l 


< 


< 


< 


E Y.^D5g)j 


\ 2 = 1 


EEE E(A,fcA,z)45'^z5 

2=1 k=l /=! 


n n n ^ ^ 

^ ^ ^ E (1)2,) 2 E (1^5) 5 
i=l k=l 1=1 


Remark 2. In the i.i.d. case, the coupling matrix D is the identity matrix. 
Hence inequality Cl reduces to 

E[{g-Egf]<\\5g\\%^^^ 

which is the analogue of Theorem 4 in OF- 

We now turn to higher moment estimates. We have the following theorem 
from which we recover Theorem |2l but with a bigger constant. 

Theorem 3. Let n € N be arbitrary. For all functions g : M and for 

any p £N, we have 

E [{g - EgfP] < {20pfP 
Proof. We start from © and get 

E [{g - EgfP] = E 

Now, by m and since E[Vi\Fj) = 0 for i > j, g—Eg is a martingale, to which we 
apply Burkholder-Gundy’s inequality [HI formula II.2. 8 , p. 41]: for any q>2, 
we have 

(e^') 

\i=i 

Therefore, for q = 2p, p G N, this gives at once 
E [(5 - EgfP] < {2QpfP E 



E [\g — E^rj'^] 1 < 10(? E 







2p 








We now estimate the rhs by using ©: 

E [iY.yi'f] 

i 

i\ ip 

Y,---Y.^[iD6g)l---{D69)t 

i\ ip 

E E E E(nA„,,A„. 

h---ip jl --jp ki---kp \r=l 


< 


Yl ^jr9 ^kr9 


\r=l 


< 


E E E n 4.9 

il---tp jl---jp ki --kpr=l 


= S II4IIX 

where in the fourth step we used the inequality 


2p 


E(/i---/2p)<nWf))^ 


2=1 


(14) 

(15) 


which follows from Holder’s inequality. □ 

In order to be able to apply Theorems |2l and El one needs to estimate 

Proposition 1. Assume inequality m> holds, and let p £ N. We have the 
bound 

OO 

+ IIV’IIt(n) • 

i=i 

Proof. We start by an upper estimate of From the definition (lllj) and 

the bound m we have using Minkowski’s inequality (for j > i) 

vff = E[(A",)'^] < E((]l{£i(a) >j-i} + ^ < 


E 


{t{li{a) >j- + ipj-i < P(4(o') >j-i)^P+ 'ipj-i =: Ui-j , (16) 


since the law of ii is independent of i. 

Now take v G £^(N) with ||u||^ 2 (j,j = 1. We have 



OO 

E^ST’m 

< 

OO 

'^u{i - k)\vk\ 


k=l 

£^(fl) 

k=l 
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where the second inequality comes from (IT^ . Since we have the f^(N)-norm of 
a convolution, we can apply Young’s inequality (see, e.g., EH) to get 

The result immediately follows. □ 


Before we state the next theorem, which is a corollary of Proposition ^ and 
Theorem |21 we need the definition of some Orlicz spaces. We only deal here 
with a restricted class useful in our applications, we refer to nHnn for the 
general definition. For ^ > 0, let : M ^ M"*" be the Young function defined 
by 

where hg = ((1 — g )/<£»<!}. These are the Young functions used in 
particular in EH We recall that (see EH) the Luxembourg norm with respect 
to <l>g of a random variable Z is dehned by 



inf 


|a > 0 



Remark 3. Note that for <hp(x) = \xf’, the Luxembourg norm is nothing but 
the L^-norm. 


Theorem 4. Let n G N 5e arbitrary. Then, for all functions g : —>■ M, for 

any p G N and any e > 0, we have 

£[(<, - E;,)"*'] < 


i^Opf” (C(l + t/(2p - i))(2p-i)/2fE(<5>’+')'/"’’ + IIVII,.,,,)"'’ l|i9llX„, (17) 


where, as usual, ( denotes Riemann’s zeta function. For any tD > 0, there is a 
constant C§ > 0, such that for any £) < tl/(l + 1 ?) satisfying C('^?(l — q)/q) ^ 
we have 


g-'Kg 







(18) 


Remark 4. A similar result holds when — q)/q) < 1 with the square root 
of the zeta function. Note also that when g increases to '&/{! + 1 }), the number 
?9(1 — g)/g decreases to one. 

Proof. We hrst estimate in terms of some moment of Iq. Let 

e' = el{2p — 1). We have using Holder inequality 


i=i 


< 


< 


> ^y/2p .-[2p-l)(l+e')/2p 

i=i 


l/2p 


C(1 + £ j2p-i+T(4(a) > j) 


ii=i 


C(1 + e')(2p-i)/2p . 
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Using Proposition^ we get 

< C(1 + + ||V>|Li(«) , 

and (El) follows using Theorem El 

To prove m, we first observe that from ED we have, for q even 

< 10g(c(l + e/{q - l))"^E(Oi + . 

L‘I(P) 


g-% 

11*^511^2 (P, 


We now recall that for any 1 > > 0, there is a constant Cg > 1 such that 


Cg ^ sup■ 

q>2 




< 11^1 


< Cq sup■ 

q>2 


|r'3'(p) 


71 /e 


(See, e.g., (H] for a proof.) It is easy to verify using Young’s inequality that the 
same inequality holds (with slightly different constants) when the supremum is 
taken over the q even integers, and we will only consider such q below. 
Therefore if0<^)<'i?/(l+??), taking 


e 



1 


we get 


ll'^g|L2(fj) 



< 0(1) supgi-V^C(l + e/{q - l))^'?-!)/'? E(£g+^)V9 + 0(l)||V.||,i(„) 

q>2 

< o(i)supc(^h^)'’ ‘’^’iko||JT‘’’'' + o(i)II^II<‘i.) 

g>2 V ^ / 


since the function ( is decreasing. Thus m is proved. The proof of the theo¬ 
rem is now complete. □ 


It is easy to obtain from Theorem^the following concentration inequalities. 
Proposition 2. Let n be an arbitrary positive integer. 

• //E(/’q^^'^) < 00 (for some e> 0), and HV’lLpN) < 00 , we have 

11 (5g 11^1* 

P{|g-Eg|>t}<C,^^ (19) 

where Cp g]0, oo[, p gN, for any g : —> R. 

• Let 0 < g < 1. If liV’lLpN) < 00 , and po||$,> < 00 for some d > g/{l — g), 
there exists a constant g]0,oo[ such that 

P{|s-E,|>i}<4exp(^-c„,^^j . (20) 

for any g : A’^ —>■ R. 
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Proof. The proof of (jlhl) is an immediate consequence of CZI) applied to g 
and —g and Chebychev’s inequality. 

For the proof of CT- we have for any A > 0 using Chebychev’s inequality 


P( 5 r -Eg>t) = 


( ff-Eg 


> 




< 


< 




g - Eg 

1 


>^c 


d>g(t/(A||5g||,2(„))) 


E 




A||5g||,2, 

/ g - Eg 

V J _ 


We now take A = 
1. Thus we have 


||(g -Eg)/||(ig||,2„J|$ By definition, E 




.9—El? 

I Mm,2,, 


P(g - Eg > t) < 


^>g(t/||g -Eg||$J 


Of course, the same inequality holds with —g. Applying m yields pn|). The 
proposition is proved. □ 


In concrete applications of inequality P|) we have to check that Cp < oo, 
otherwise the inequality is useless. To apply (TO . we have to check that Cg > 0. 
We will give an example of application below. 

Inequality P|) is a “polynomial” concentration inequality whereas inequal¬ 
ity ((^ is a “stretched-exponential” concentration inequality. 

Remark 5. The 4 in the r.h.s. of (I2()|) is not optimal. It can be replaced by 
2/(1 — e) for any e g]0, 1[. 


2.4 Random fields 

We now present the extension of our previous results to random fields. This 
requires mainly notational changes. We work with lattice spin systems. The 
conhguration space is = {—,+}^ , endowed with the product topology. We 
could of course take any finite set A instead of {—, +}. For K dlA and a,r] ^ It 
we denote cr\r]\c the configuration coinciding with a (resp. g) on A (resp. A'’). 
A local function g : —>■ R is such that there exists a finite subset A C such 

that for all u, g,a;, g((TAWA=) = g(<7AgA'=)- 

For a ^ It and x G Z'^, denotes the configuration obtained from a by 
“flipping” the spin at x. We denote 5xg = sup^ |g(<7’^) — g(c’')| the variation of 
g at X. 6g denotes the map Z'^ —>• R : x i—!■ 6xg. 

We introduce the spiraling enumeration F ; Z'’* ^ N illustrated in the figure 
for the case 4 = 2. 

We will use the abbreviation (< x) = {g G Z'’* : r(g) < r(x)} and similarly 
we introduce the abbreviations (< x). By definition IF<x denotes the sigma-held 
generated by cr{y), y < x and denotes the trivial sigma-held. 


12 



























































7 :8 9 

10 











.. 6 . 

i 2 













n 













543 



























































For any local function 5 : fl —> M, we have the analog decomposition as in 

©: 

9-^9= (21) 

xeJA 

where 

14 := E [g\J^<x] - ^[g\J^<x] ■ 

The analog of the coupling matrix is the following matrix indexed by lattice 
sites X, y G 

Z)J,,:=P^,+,_{Xi(y)/X2(y)} (22) 

where PJ + _ denotes the maximal coupling between the conditional measures 
P(-|cT<a;^+^) and P('|iT<a;,-^). The notation “+a;” (resp. x”) means that at 
coordinate x in the configuration we put a “+” (resp. a 

We first consider the case of uniform decay of D. In that case, the expo¬ 
nential concentration inequality of Theorem ^ holds with the norm of (.2 (^‘^); 
i-e., ¥g\\l = Ex^z<^ {6xg)‘^ (which is trivially finite since s' is a local function). 

Theorem 5. Assume that 


:= sup D'^ y (23) 

a 

is a bounded operator in Then for all loeal functions g we have the 

following inequality 


'{\g-Eg\ > t} < 2exp 


2V 




(24) 


for all t > 0. 

In the non-uniform case, Theorems |210 and Proposition [21 extend immedi¬ 
ately as follows. The analog of m is 


Df y < ]1{ £x{cr) >\y- x\} -h ^p{\y - x|) . (25) 
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From now on, we assume that the distribution of ix is independent of x. We 
extend the matrix T> dehned in m by putting 


Dg := nm.,)’]''” 


for X, y G 

Theorem 6. For any local function g and for any p G N, we have 

E[(9-E9)">’] <(20p)'’ 

Theorem 7. For any local function g, for any p G N and any e > 0, we have 

E[{g-EgfP] < 


{20pfP (C(l + e/{2p - l))(2p-i)/2PiE(^2pd+.^i/2p ^ mX,,, (26) 

where, as usual, C denotes Riemann’s zeta function. For any > 0, there is a 
constant C§ > 0, such that for any g <’&/{! + i}) satisfying C(^(l — q)/q) ^ 
we have 

< C„ (c ■ (27) 

Proposition 3. For any local function g we have the inequalities: 

• //< oo (for some e > 0), and ||'!/’||£ 1 (n) < oo, we have 


g-% 


]P{|g - Eg| >t}<C.^ 
where Cp g]0, oo[, p G N. 


P i2p 


Let 0 < ^ < 1. If liV'llp(N) < oo, and Po||<i.,, < oo for some D > q/{1 — q), 
there exists a constant Cg^u g]0,oo[ such that 


t^ 


’{\g-Eg\ >t} <4exp I -p^ 


Remark 6. It is immediate to extend the previous inequalities to integrable 
functions g belonging to the closure of the set of local functions with the norm 

lllglll := ¥9\\E^aL- 
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2.5 Existence of the coupling by bounding the variation 

We continue with random fields and state a proposition which says that if we 
have an estimate of the form 


K. < {D6g)^ 

for some matrix D, then there exists a coupling with coupling matrix D such 
that its matrix elements decay at least as fast as the matrix elements of D. We 
formulate the proposition more abstractly: 

Proposition 4. Suppose that P and Q are probability measures on and 
O' : n —> M such that we have the estimate 

|Ep[5] -E(Qb]| < ^ p{x)6^g (28) 


for some “weights” p : ^ M'*'. Suppose ^ M'*' is such that 

^ p{x)p{x) < oo. 

Then there exists a coupling p of ¥ and Q such that 

jl {Xi{x) / X 2 {x)} p{x) < Y t{x)p{x) < oo . 

Proof. Let := [— n, nY n Dehne the “cost” function 

C'n(o'w') := Y t{x). 

x£B„ 

Denote by P^, resp. Qn, the joint distribution of {crx,x G Bn} under P, resp. 
Q. Consider the class of functions 

:= {gl \g{cr) - g{a')\ < Y T{x)t{ax^ ^9.} . 

xO.Z'^ 

It is obvious from the definition that g G Gc!^, if, and only if, g is Xb^- 
measurable and 

{Sxg)icr) < <p(x) Vx G VfJ G D. 

Therefore, if (t^ holds, then for all g G Gq^, 

|Ep[c/] -Eq[c/]| < Y pi^Yxg < Y p^^)t{x) ■ 

Hence, by the Kantorovich-Rubinstein duality theorem nzi, there exists a cou¬ 
pling pn of Pn and Q„ such that 

lE/in (C'n(o-wO) [ y] t{x)HXi{x) /X2(x)} J < Y t{x)p{x). 

\x&B„ J x&P 
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By compactness (in the weak topology), there exists a subsequence along which 
jin converges weakly to some probability measure jl. For any k < n, we have 


E ^ix)nxiix) ^ X2ix)} I < 

\xeBk ) 

( E ^{x)t{Xi{x) /X 2 (x)} j < ^ Lp{x)p{x). 
\xeBn ) xeTA 

Therefore, taking the limit n —> oo along the above subsequence yields 


E l^{x)t{Xi{x) /X 2 (x)} < ^ ^{x)p{x). 
\xeBk J x&'i 

We now take the limit /c —> oo and use monotonicity to conclude that 


Er, I E ^ix)HXiix) 

c£'E‘^ 


y^X2(x)} 


< E ■ 


□ 

We shall illustrate below this proposition with the example of Gibbs random 
fields at high-temperature under the Dobrushin uniqueness condition. 

3 Examples 

3.1 High-temperature Gibbs measures 

For the sake of convenience, we briefly recall a few facts about Gibbs measures. 
We refer to [S] for details. 

A finite-range potential (with range R) is a family of functions U (A, a) 
indexed by finite subsets A of such that the value of U{A, a) depends only 
on a A and such that U{A, cr) = 0 if diam(A) > R. If i? = 1 then the potential 
is nearest-neighbor. 

The associated finite-volume Hamiltonian with boundary condition rj is then 
given by 

^a(^) = E U{A,aApA<^)- 

AnA^iD 

The specification is then defined as 


7A(o-|r?) 




We then say that P is Gibbs measure with potential U if 7 a(o'|') is a version of 
the conditional probability P(ita|.^a=)- 
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Before we state our result, we need some notions from [7]. What we mean by 
“high temperature” will be an estimate on the variation of single-site conditional 
probabilities, which will imply a uniform estimate for disagreement percolation. 
For y G let 


Py ■■= 


2 sup 


(O'y — + 


-n< 



Writing p for {py)y, let denote the Bernoulli measure on with 

i'p{{X{y) = -|-}) = Py, and its single-site marginal. 

From [7| Theorem 7.1] it follows that there exists a coupling PJ _ of the 
conditional distributions P('|(T<a;,and P(-|(T< 3 ;, —a;) such that under this 
coupling 

1. For y > X, the event Xi{y) ^ X 2 {y) coincides with the event that there 
exists a path 7 C \ (< x) from x io y such that, for all 2 ; G 7 , 
Xi{z) 7 ^ X 2 {z). We denote this event by “x-<^y”. 

2. The distribution of ]l{Xi(y) / X 2 {y)} for y G Z*^ \ (< x) under is 

dominated by the product measure 

n ’^Py ■ 


Let Pc = Pc{d) be the critical percolation threshold for site-percolation on Z'^. 
It then follows from statements 1 & 2 above that, if 

sup{py :y <pc (29) 

then we have the uniform estimate 

K^+,_{X,{y)^X2{y)}< n (30) 

Then we can apply Theorem [3 to obtain 

Theorem 8. Let U be a nearest-neighbor potential such that © holds. Then 
for the coupling matrix we have the uniform estimate 

for some C > 0. Hence we have the following exponential concentration in¬ 
equality: for any local function g and for all t > 0 


P {|5 -%l >t}< 2 exp -- 


2H 




P5II; 


Remark 7. Theorem\^can easily be extended to any finite-range potential. 
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Theorem [H] was obtained in [ini in the Dobrushin’s uniqneness regime [HI 
Chapter 8] nsing a different approach. The high-temperature condition which 
we use here is sometimes less restrictive than Dobrushin’s uniqueness condition, 
but sometimes it is more restrictive. However, Dobrushin’s uniqueness condi¬ 
tion is not limited to finite-range potentials. We now apply Proposition |1] to 
show that in the Dobrushin’s uniqueness regime, there does exist a coupling of 
IF’(’|o'<x,-i-^) and P(-|cr<a;,-„,) such that the elements of the associated coupling 
matrix decay at least as fast as the elements of the Dobrushin’s matrix. The 
Dobrushin’s uniqueness condition is based on the matrix 


a 


^,y 


2 sup 




This condition is defined by requiring that 

sup ^ Cx,y < 1 

and the Dobrushin matrix is then defined as 

^ ^X,y ■ 
n>0 

We now have the following proposition: 

Proposition 5. Assume that the Dobrushin uniqueness condition holds. For 
any (^ : Z'’* —> M"'' such that for any x G Z'^, 

^ ip{y)Ay^^ < oo . 
yeZ’i 


Then there exists a coupling Ff. _ o/P(-|(T<a;^+„,) and P(-|cr<a;,-„,) such that 

Y1 feV {^i(d) / ^2(y)} < oo. 

Proof. From jlOL Lemma 1], we have the estimate 
J g{r]) F{dr]\a<^,+J - j g{r]) F{dr]\a<^-^) 

(where 1x,y denotes the Kronecker symbol). 

We can apply Proposition |1] to conclude the proof. □ 


— ^ ^ i^x,y T ^y,x)dyg 


As an example we mention that if the potential is finite-range and translation- 
invariant and satisfies the Dobrushin uniqueness condition, we have for large 
enough \x — y\ 

Ay,x < 

and hence there exists a coupling P^^ _ such that 

r,X-{Xiiy) ^ X2{y)} < 

for all d < c and large enough \x — y\. 

Unfortunately, we are not able to construct explicitly such a coupling. 
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3.2 The low-temperature Ising model 

It is clear that for the Ising model in the phase coexistence region, no expo¬ 
nential concentration inequalities can hold. Indeed, this would contradict the 
surface-order large deviation bounds for the magnetization in that regime (see 
e.g. |H1 and references therein). Nevertheless, we shall show that we can control 
all moments and obtain stretched-exponential inequalities (which are compati¬ 
ble with large deviation bounds). 

We consider the low-temperature plus phase of the Ising model on Z'^, d> 2. 
This is a probability measure on lattice spin configurations u G H, defined 
as the weak limit as A | Z'^ of the following finite-volume measures: 

= exp j /3 + 

y <xy>£A <xy>,x(^dA, y^A j 

where /3 G is the inverse temperature, and is the partition function. In 
m < xy > denotes nearest neighbor bonds and dA the inner boundary, i.e. 
the set of those x G A having at least one neighbor y ^ A. The existence of the 
limit A I Z'^ of P)^ ^ is by a standard and well-known monotonicity argument, 
see e.g. [S]. 

For any 77 G H , A C Z*^ we denote by P)( ^ the corresponding finite-volume 
measure with boundary condition rj: 

^a,/ 3 ('^a) = exp /3 ^ a^cTy + P ^ a^^rix 

y <xy>£A xgA, y^A 

Later on we will have to choose /3 large enough, in particular, greater than the 
critical inverse temperature Pc {P < Pc implies uniqueness of the infinite-volume 
measure). 

We can now formulate our results on arbitrary local functions for the low- 
temperature Ising model. 

Theorem 9. Let P = P^ he the plus phase of the low-temperature Ising model 
defined above. There exists Po > Pc, such that for all P > Pq, for any local 
function g, we have the following inequalities: 

• For all p G N, there exists a constant Cp g]0, oo[ such that 

E[{g-EgfP] < Cp \\dg\Q^^,^. 

Consequently, for all t > 0, we have the concentration inequalities 
F{\g-Eg\>t}<Cp J/^ ■ 

• Moreover, there exists 0 < q{P) < 1, such that for any 0 < g < q{P) there 
is a constant Kg > 0, such that we have, for any local function g, 

< Kg \\5g\\fl^^dy 
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Consequently, there exists a constant Cg g]0,oo[ such that, for all t > 0, 


Pjl^f — E( 7 | > t} < 4exp 

Proof. This theorem is an application of Theorem 0 and Proposition 121 
All we have to do is to obtain the bound (1251) with good decay properties for 
the tail of the distribution of io to ensure the finiteness of , Po||<i>^, 

and of This is an immediate consequence of the next proposition. □ 




Ising 
holds 

for all n €N and 

P{4 >n}< C'e-<^'^" 
for some c, d, C, C" > 0 and 0 < a < 1. 


Proposition 6. Let P = P^ he the plus phase of the low-temperature 
model. There exists (3o > Pc such that for all P > Po, the inequality (EH) 
together with the estimate 

Pin) < Ce-^^ 


Proof. We shall make a coupling of the conditional measures P(-|(T<a;,+^) 
and P('|iT<a;,-^). This coupling already appeared in EH (see also [Tj)- Both 
conditional measures are a distribution of a random field cvy, y pi (< x). We 
start with the first site y\> x according to the order induced by T (see Section 
UM- We generate Xiiyi) and ^ 2 ( 2 / 1 ) as a realization of the maximal coupling 
between P(iTj^i = ■\a^x,+P) and P(o'y^ = •|cj<a;,-^). Given that we have gener¬ 
ated Xiiy),X2iy), ... ,Xiiyn),X2{yn) for y = yi,...,yn, we generate Xiiyn+i), 
X2iyn+i) for the smallest yn+i > yn as a realization of the maximal coupling 
between 


P(o-y,+i = -iXiiyi) ■ ■ ■ Xiiyn)a<x,+J and P(o-j/„+i = ■\X2{yi) ■ ■ ■ X2iyn)<y<x,-J ■ 

By the Markov property of P we have the following: if there exists a contour 
separating y from x such that for all sites 2; belonging to that contour we have 
Xpz) = X2{z), then Xiiy) = X2iy). The complement of this event (of having 
such a contour) is contained in the event that there exists a path of disagreement 
from x to y, i.e., a path 7 C \ (< x) such that for all 2 G 7, ^i(^) p -^2(2). 
Denote that event by E^y. Clearly its probability is bounded from above by the 
probability of the same event in the product coupling. In turn the event E^y is 
contained in the event Epy that there exists a path 7 from x to y in Z'^ \ (< x) 
such that for all 2 G 7, (Xiiz), X2iz)) p (+,+). In m the probability of that 
event in the product coupling is precisely estimated from above by 

Ce-^\--y\ + l{ixia)>\x-y\} ( 32 ) 

for some C, c > 0, where ixi(^) is an unbounded function of a with tail estimate 
P( 4 ((t) >n) = P( 4 (tT) > n) < 
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for some C',c! > 0 and 0 < a < 1. For the reader’s convenience, we briefly 
comment on these estimates. The idea is that the conditional measure P(-|^< 2 ;) 
resembles the original unconditioned plus phase (in \ (< x)) provided ^ 
contains “enough” pluses. “Containing enough pluses” is exactly quantified by 
the random variable is the event that for all self-avoiding 

path 7 of length at least re, the magnetization along 7, 

' 'I ZG7 

is close “enough to one”. If this is the case then under the conditional measure 
we still have a Peierls’ estimate, which produces the exponential term in (El. 
We refer to m for more details. □ 
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